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SUMMARY 


Replication and transcription of influenza virus 
genome mainly depend on its RNA-dependent RNA 
polymerase (RdRP), composed of the PA, PB1, and 
PB2 subunits. Although extensively studied, the un- 
derlying mechanism of the RdRP complex is still 
unclear. Here we report the biochemical character- 
ization of influenza RdRP subcomplex comprising 
PA, PB1, and N terminus of PB2, which exist as dimer 
in solution and can assemble into a tetramer state, 
regulated by vRNA promoter. Using single-particle 
cryo-electron microscopy, we have reconstructed 
the RdRP tetramer complex at 4.3 A, highlighting 
the assembly and interfaces between monomers 
within the tetrameric structure. The individual RARP 
subcomplex contains all the characterized motifs 
and appears as a cage-like structure. High-through- 
put mutagenesis profiling revealed that residues 
involved in the oligomer state formation are critical 
for viral life cycle. Our results lay a solid base for un- 
derstanding the mechanism of replication of influ- 
enza and other negative-stranded RNA viruses. 


INTRODUCTION 


The influenza virus is a negative-sense, single-stranded RNA vi- 
rus belonging to Orthomyxoviridae family whose pathogenicity 
has brought significant long-term threat to human health. Influ- 
enza virus contains eight segmented RNAs as its genome, which 
can synthesize at least 16 proteins. The influenza RNA-depen- 
dent RNA polymerase (RdRP), a heterotrimer with a mass of 
about 250 kDa, is composed of three subunits, namely PA, 


® CrossMark 


PB1, and PB2. RdRP plays a central role in replication and 
transcription of the viral RNAs (vRNAs) (Fodor, 2013). It is 
responsible for both viral replication and transcription, which 
can produce three different types of RNAs —vRNA, complemen- 
tary RNA (cRNA), and messenger RNA (mRNA). In the virus, influ- 
enza RdRP subunits are bound to the 5’ and 3’ ends of the viral 
genome RNAs and are further wrapped with nucleoproteins to 
form viral ribonucleoproteins (VRNP and cRNP) (Tiley et al., 
1994). The 5’ and 3’ ends of genomic vRNAs or cRNAs act as 
promoter during RNA synthesis and remain partially paired, 
resembling a corkscrew structure (Flick et al., 1996; Tomescu 
et al., 2014). Binding of RdRP to the promoter RNA is crucial 
for both transcription and replication, in which the whole viral 
RNPs act as template. 

To fulfill viral replication and transcription, the three subunits 
of the influenza RdRP concisely cooperate with each other. A 
cis-acting model proposed that only one RdRP is involved during 
the transcriptional process (Fodor et al., 1994; Hagen et al., 
1994; Luytjes et al., 1989), whereby pre-mRNAs from host cells 
are snatched by the 5’-cap binding domain of PB2 and cleaved 
to form a 10-13 nucleotide (nt) primer by the endonuclease activ- 
ity harbored by the N terminus of PA (PA_N) (Dias et al., 2009; 
Fechter et al., 2003; Guilligay et al., 2008; Hara et al., 2006; 
Nakagawa et al., 1995; Plotch et al., 1981; Yuan et al., 2009). 
The 5/-cap-containing primer is then involved in elongation of 
mRNA along vRNA genome by the PB1 subunit, which contains 
conserved RdRP motifs; transcriptase stops at the polyU posi- 
tion, 17-22 bases away from the 5’ end of vRNA (Fodor et al., 
1994; Hagen et al., 1994; Robertson et al., 1981). 

Influenza virus replication is distinct from transcription in that 
initiation can take place in the absence of primer to produce a 
full copy of the vRNA genome through a cRNA synthesis inter- 
mediate step (de novo synthesis without adding 5’-cap struc- 
tures and polyA tails) (Deng et al., 2006a, 2006b; Zhang et al., 
2010). Furthermore, in contrast to transcription, more than one 
RdRP complex has been reported to coordinate in trans during 
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viral replication (Jorba et al., 2009; Moeller et al., 2012). Electron 
microscopy (EM) studies suggested that apart from the RdRP 
located at the termini of RNP structure, a new RdRP is recruited 
during synthesis of corresponding RNAs (Moeller et al., 2012) 
which is proposed to fulfill genome RNA replication process 
(Jorba et al., 2008). 

In contrast to double-stranded and positive-stranded viruses 
(Ng et al., 2008), the functional mechanisms underlying RdRP 
from negative-stranded RNA viruses are still elusive, probably 
due to complexity in the structural organization of RARP. Low- 
resolution EM structures of viral RNP showed that the RdRP 
has an overall cage-like structure and probably undergoes 
conformational changes to fulfill its multiple functions (Area 
et al., 2004; Arranz et al., 2012; Moeller et al., 2012; Torreira 
et al., 2007). Previously reported crystal structures of PA_C (res- 
idues 256-716)/PB1 (residues 1-25) and PB1 (residues 680- 
759)/PB2 (residues 1-37) complexes suggest that the three 
subunits bind to each other in a head-to-tail model in the order 
of PA-PB1-PB2 (Fodor, 2013; He et al., 2008; Obayashi et al., 
2008; Ohtsu et al., 2002; Sugiyama et al., 2009). Structures of 
the cap-binding domain and C terminus of PB2 have been deter- 
mined (Guilligay et al., 2008; Kuzuhara et al., 2009). But it is still 
unclear how this exquisite replication machine is assembled to 
fulfill its diverse functions. It will be of particular interest to 
know how the cis and trans conformations during transcription 
and replication processes are regulated and how a new RdRP 
is recruited at the beginning of replication. An integral and 
detailed structure of influenza RARP is essential for understand- 
ing the assembly, oligomerization, and versatile functions medi- 
ated by the complex. 

For determination of the structural assembly of influenza 
RdRP, we purified an RARP subcomplex (termed as subcomplex 
I) of influenza type A virus comprising full-length PA and PB1 
subunits and the N-terminal 130 amino acids fragment of PB2 
(PB2_N). The complex was reconstructed at 4.3 A resolution 
by cryo-electron microscopy (cryo-EM) single-particle method, 
which clearly demonstrates the relative position of PA_C, PB1, 
and PB2_N in a tetramer state. Further biochemical and func- 
tional analysis suggests a dual RdRP-involved RNA synthesis 
model for the replication of influenza virus. Our data provide a 
structural basis for understanding the assembly and regulatory 
mechanism of influenza RdRP replication, and pave the way 
for exploring functional mechanisms of other negative-strand 
RNA viruses, for instance, Ebola virus, which has recently spread 
at an alarming rate and gained pandemic status in West Africa. 


RESULTS 


Assembly and Oligomeric State of Influenza RdRP 

The influenza A virus (strain A/goose/Guangdong/1/96/H5N1) 
RdRP was expressed in Trichop /usiani (BTI-Tn5B 1-4, Hi5) insect 
cell line using Bac-to-Bac baculovirus expression system (Invi- 
trogen Inc.). Initial attempts to purify the full-length protein com- 
plex proved to be challenging due to repeated degradation of 
PB2 subunit at the C terminus, leaving only about 130 residues 
at the N terminus (confirmed through MS spectrometry and pro- 
tein sequencing). We therefore re-engineered the subcomplex l, 
which appeared as an ~200 kDa monomeric heterotrimer. The 
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subcomplex | existed as dimer in solution with an apparent mo- 
lecular weight of about 400 kDa, as evident through size-exclu- 
sion chromatography (Figure 1A). Upon incubation with either 
vRNA or cRNA promoter, the dimer spontaneously assembled 
into a tetrameric state (Figures 1A and 2A). Existence of the 
tetrameric RdRP conformation was found to be consistent in 
all tested strains of influenza virus, including strain A/Brevig 
Mission/1/1918(H1N1), A/PR/8/34(H1N1), A/Victoria/3/1975 
(H3N2), and A/goose/Guangdong/1/96 (H5N1) (all the RARP 
subcomplexes lacked the C-terminal residues of PB2; data not 
shown). Furthermore, subcomplex | could also oligomerize into 
a tetramer state in the presence of single-stranded 5’ arm of pro- 
moter vRNA or cRNA (Figures 1C and 1D, blue lines), suggesting 
that tetramerization is dependent on the presence of 5’ arm of 
viral promoter RNA. However, upon addition of single-stranded 
3’ arm of promoter RNA (vRNA or cRNA) to the RNA-free poly- 
merase, a significant portion of the dimer dissociated into mo- 
nomers (Figures 1C and 1D, magenta lines). Based on these 
results, we hypothesize that the oligomerization state of influ- 
enza RdRP is regulated by binding of RNA promoter. To analyze 
the dimerization interface, we made a series of truncations and 
subsequently identified another subcomplex (named as sub- 
complex Il) including full-length PA and PB1 subunits along 
with the N-terminal 86 residues of PB2. Subcomplex II failed to 
dimerize and existed only as monomer in solution, with or 
without viral promoter RNA (Figure 1B). These results indicate 
that residues 86-130 of PB2 are crucial for oligomerization of 
influenza RdRP. 


Architecture of RARP Subcomplex | 

Using state-of-the-art cryo-EM instrument of a 300 kV FEG 
transmission electron microscope equipped with a direct elec- 
tron-counting camera, we performed cryo-EM analysis of sub- 
complex | in the presence of vRNA promoter. An overall 6.8 A 
resolution cryo-EM map for the complex was calculated (see 
Supplemental Experimental Procedures). Consistent with the 
elution profile from size-exclusion chromatography, we obtained 
a tetrameric structure resembling a flat square with each mono- 
mer at the corners following D2 symmetry —two complexes at di- 
agonal positions facing up and the other two facing down (Fig- 
ures 2 and 5, and see Figures S1 and S2 online). Overall 
dimension of this square tetrameric structure is about 180 x 
150 x 70 A? with a central empty space of 90 x 45 x 70 A®. 
Two types of monomer interaction exist at the interface—one 
is more rigid, and the other is relatively flexible (Figures 2, 5, 
S1G, and S2B). We therefore could distinguish the two dimers 
in the tetramer assembly. To further improve the resolution, a 
soft mask including the rigid dimer within the tetramer (Fig- 
ure S2C) was applied, and iterative 3D refinement of the same 
data set was performed. The process dramatically increased 
resolution of the dimer to a final 4.3 A (Figures 2B and S2). The 
near-atomic resolution of the electron density map enabled clear 
identification of major secondary structural elements with addi- 
tional bulky side chains (Figures 2C and S3; Movie S1). 


Atomic Model of the RARP Complex 
From the high-resolution 3D reconstruction, we were able to 
isolate the subcomplex | monomer that revealed as a cage-like 
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Figure 1. Oligomer States of the Purified Subcomplexes 
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(A) Size-exclusion chromatography and SDS-PAGE analysis of subcomplex |. The UV absorptions at 280 nm and 260 nm are shown as solid and dashed lines, 
respectively. The elution volumes of RNA-free (in black) and RNA-bound (in blue) samples are 58 ml and 53 ml, respectively, suggesting dimer and tetramer 


formation. 


(B) Size-exclusion chromatography of subcomplex Il, similar to Figure 1A. Both RNA-free (in black) and RNA bound samples (in blue) are eluted at 63 ml with an 
apparent molecular weight of 200 kDa, suggesting monomeric state of the complex. The bands for PB1 and PB2 subunits are indicated in the SDS-PAGE analysis 
result shown on the right panel, while the small fragment (residues 1-86) from PB2 is invisible in the gel. 

(C and D) Oligomeric state transition of subcomplex | induced by 5’ or 3’ arms of promoter. The result of RNA-free sample is shown in black. Results for the 5’ arm 
(in blue) or 3’ arm (in magenta) of VRNA and cRNA promoter binding to subcomplex | are shown in (C) and (D), respectively. 


structure with dimensions of 90 x 75 x 70 A® (Figure 2C), similar 
to previous cryo-EM results as well as with RdRP structure of 
reovirus (RV) (Area et al., 2004; Tao et al., 2002; Torreira et al., 
2007). 

Initially, we docked the structure of PA_C (PDB ID code 3CM8) 
into the EM map (He et al., 2008) (see Supplemental Experi- 
mental Procedures). The positively charged groove, initially iden- 
tified in PA_C, is located toward the cavity. All the helices and 
most of the strands belonging to PA_C domain could be well- 
fitted onto the map (Figures 2C, 3A, and S5B); some additional 
side chains in this domain were also visible (Figures S3A-S3F). 
The PB1 model was built de novo by combining the 4.3 A reso- 
lution cryo-EM density map and bioinformatics approaches such 
as sequence alignment of the characterized RdRP motifs, sec- 
ondary structure prediction, and topology analysis of some 
known RdFPs of viruses (Figure S4; Supplemental Experimental 
Procedures) (Andino et al., 1993; Butcher et al., 2001; Ferrer- 


Orta et al., 2004; Tao et al., 2002; Zamyatkin et al., 2008). The 
final PB1 model was built with polyalanine which contained 21 
helices, covering most of the predicted helices in N-terminal 
620 amino acids of PB1 with some interruptions in some regions 
(Figures 3A, 3B, S4B, and S5A). After building the core region of 
PB1 subunit, we assigned the densities, located besides the 
thumb domain of PB1, to be the C-terminal part (PB1_C, resi- 
dues 620-757) and PB2_N. Due to low resolution at this region, 
we could only build six additional helices to depict the model 
(Figures 2C, 3A, and S5B; Movie S1). 

The structure of PB1 resembles a ring-like structure (Figures 
3C and S5A). Briefly, PB1_N peptide (residues 1-16) is bound 
by the mouth region of PA_C. The following peptide traverses 
across the inner surface of the cavity and goes down toward 
the finger domain. The peptide, covering residues 40-480, folds 
into the palm (top) and finger domains (bottom), respectively 
(right part of Figure 3A). The peptide chain (residues 480-620) 
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Figure 2. Cryo-EM Map of Subcomplex | 

(A) Representative 2D class averages of tetrameric subcomplex | cryo-EM 
images. 

(B) 3D map of the tetrameric subcomplex | at 4.3 A resolution, depicting a 
square structure with four monomers. The model is a composition of two 3D 
reconstructions of the dimer portion within a tetramer. The three orthogonal 
views are all along the apparent 2-fold symmetry of each view. For clarity, in 
the two side views only the front dimer is shown. 

(C) Atomic models of the PA_C (purple), PB1 (yellow), unassigned region 
(cyan), and promoter RNA (two strands in green and brown, respectively) 
docked in the 3D map of a monomer at the upper-left corner of the tetramer 
model in (B). 

See also Figures S1 and S2 and Movie S1. 


goes from the palm region to the upper part of PA_C, in which 
the C-terminal thumb domain is further folded (Figures S4B 
and S5A). 

Further search for homology using Dali server (http://ekhidna. 
biocenter.helsinki.fi/dali_server/start) showed that the structure 
of PB1 shares high similarity to those known RdRPs, such as 
hepatitis C virus (HCV) (HS5B, PDB ID code 2BRk), polio virus 
(PV) (PDB ID code 30L6), and foot and mouth disease virus 
(FMDV) (PDB ID code 3KOA) with a normal root-mean-square 
deviation (rmsd) of around 3.5-4 A (Ago et al., 1999; Andino 
etal., 1993; Bressanelli et al., 1999; Lesburg et al., 1999) (Figures 
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4A and S5C). Most of the secondary structure elements of these 
proteins could be aligned, including locations of the character- 
ized RdRP motifs. 

To sum up, the PA_C domain is located on the left part of the 
subcomplex structure, while the finger and palm domain of PB1 
is located on the right part. The thumb domain of PB1 is seated 
on the top of the PA_C molecule (Figure 3A). 


Cage Cavity Formed by PA_C and PB1 for RNA Binding 

The assembly of PA_C and PB1 creates a partially enclosed empty 
central cavity, within which a significant but unassigned density 
bearing the shape of a short bending sheet is located at the bottom 
of the cavity between PB1 finger domain and PA_C brain region 
(Figures 2C and S3G-S3)). As reported earlier, the VRNA promoter 
has been predicted to contain an unpaired 9-10 nt region followed 
by a 5-6 base-paired region (Tomescu et al., 2014). The bending 
sheet-like density could be well-modeled with a 6 base-paired 
double-stranded RNA structure (Figures 3A and S3G). We pro- 
pose this to be the paired promoter RNA. Furthermore, a potential 
density for one arm of the unpaired vRNA chains could be located 
toward the inner surface close to the semicircular positively 
charged groove and neck region of PA_C (Figure S3H). The den- 
sities for the other arm of the promoter could not be well-defined, 
although we detected a discontinued unassigned density 
following the duplex RNA region, extending toward the bottom 
of PB1 finger domain (Figure S3l). The low resolution of the density 
map may result from the potential flexibility of this RNA chain. 

Some vRNA binding regions in the RARP complex have been 
reported previously (Fodor et al., 1993; Gonzalez and Ortin, 
1999; Li et al., 1998). Residues 1-83, 233-256, 571-572, and 
493-759 in PB1 were implicated to be involved in vRNA pro- 
moter binding. Therefore, the cavity formed by PA_C and PB1 
can be presumed to be the site for RNA binding and synthesis. 
PA_C and the thumb domain provide the left side and upper sur- 
face of the cavity, while PB1 finger and palm domain constitute 
the right side. Furthermore, a few positively charged residues 
in the semicircular groove of PA, including Lys539, Arg566, 
and Lys574, were found to be highly conserved, and their muta- 
tions disrupted all three types of RNA synthesis activity as well as 
ability for promoter RNA binding by the RdRP complex (Figures 
3D and S5E-S5H). A loop between £6 and 87 in PA_C including 
residue His510 is also located in this region. 

Further detailed analysis of PB1/PA_C interaction showed that 
helices «5, «6, «7, and «8 on PA_C provide the major elongated 
platform to interact with «17 and «19 on the thumb domain of 
PB1. Specifically, residues Glu427, lle428, Arg442, Arg443, 
R582, Leu586, Gin590, Gin591, and Glu593 in PA_C were 
observed to interact with the thumb domain of PB1 (Figure S5D). 
It has been reported that Arg442 and Arg443 are essential during 
replication (Obayashi et al., 2008). Involvement of charged resi- 
dues at this interface implicates that the interaction might be 
regulated, presumably by vRNA binding. In addition, the outer 
surface of PB1 structure is surrounded by the long linker be- 
tween PA_N and PA_C (residues 200-255) (Figures 3A and 
S5B), consistent with previous results showing that this linker 
necessarily binds to PB1 (Guu et al., 2008). The large interaction 
area between PA and PB1 indicates a strong complex formation 
between the two subunits. 


1əßul4 


Structure Comparison of PB1 with PV RdRP and 
Implications for RNA Synthesis 

Structural similarities in the catalytic domain of influenza PB1 
with HCV and PV RdFPs (Figures 4A and S5C) suggested that 
these viruses may use similar mechanisms for RNA synthesis. 
We further superimposed the PB1 catalytic domain with the PV 
RdRP elongation complex bound to three RNA chains (PDB ID 
code 30L6) (Andino et al., 1993). An 18 nt template RNA and a 
14 nt nascent RNA could be docked into the cavity of the sub- 
complex without significant conflict with surrounding residues, 
while the three-residue cRNA paired to the 5’ terminus of the 
template remained outside the cavity (Figure 4B). The proposed 
RdRP motifs Pre-A, A, B, and C in PB1 are closely in contact with 
the RNA template as well as the product (Figure 4C). The 3’ end 
of the product is located at the center of these motifs, closely 
pointing toward the conserved S-D-D loop, for addition of new 
nucleotide. The secondary structure elements constituting the 
inner surface of this central cavity in the PA/PB1 structure are 
largely involved in RNA binding. Based on this comparison, we 
propose that the influenza RdRP might bear an RNA synthesis 
pathway similar to that for PV. Three gates are proposed for 
the influenza RNA synthesis activity. The gate for RNA template 
entry is predicted to be at the bottom of the cage cavity, while 
exit gate for the product would be in front of the structure (facing 
toward readers as in Figure 4A). The NTP entry gate provided by 
motif D would be located in the rear of the structure (te Velthuis, 
2014) (Figures 4A and 4B). 


Formation of Dimer and Tetramer RdRP Subcomplex 
In the tetramer structure (Figures 5A and 5B), two types of mono- 
mer interactions are visible—one interaction within a stable 


Figure 3. Overall Structure of the Subcom- 
plex I Structure 

(A and B) Different views of the structure PA_C 
(purple) and PB1 catalytic domain (yellow). The 
additional unassigned helixes are shown in cyan. 
The RNA duplex bound to the RdRP subcomplex | 
is shown in green and brown. Different regions of 
the complexes are labeled or indicated using 
black arrow in the figure. 

(C) Overall structure of PB1 subunit. The palm, 
finger, and thumb domains of PB1 have been 
labeled in the figure. The characterized catalytic 
motifs, namely Pre-A, A, B, C, and D, are shown in 
red, blue, green, cyan, and purple, respectively. 
(D) The residues on PA_C domain predicted to be 
critical for RNA binding are labeled and shown as 
sticks. B6 and B7 in PA_C are shown in green. The 
neck loops are indicated using black arrow. 

See also Figures S3-S5. 


dimer (hereafter termed interface 1, be- 
tween monomers 1 and 2) is mediated 
through brain regions of adjacent PA_C 
domains and the PB1_C/PB2_N part 
containing the thumb domain of PB1 (Fig- 
ure 5C). Residues 293-355 containing 
helices «1 and «2 and a loop 350-NEE- 
KIP-355 (L350-355) between «2 and a3 in PA_C contribute to 
most of the interaction within this region. Particularly, L350- 
355 loop of monomers 1 and 2 are located antiparallel to each 
other and may form multiple interactions (Figure 5C). Two heli- 
ces, H1 and H2 in the PB1_C/PB2_N region, are also present 
at the interface of the two monomers (Figure 5C). They are pre- 
sumed to cover residues 86-130 of PB2 subunit, based on our 
results from size-exclusion chromatography which showed 
that subcomplex II existed only as monomer, even in the pres- 
ence of bound vRNA promoter (Figure 1B). Indeed, secondary 
structure prediction of PB2 performed by PSIPred (hittp:// 
bioinf.cs.ucl.ac.uk/psipred/) shows that this region (residues 
86-130) might contain two helices. 

The second interaction (hereafter termed as interface 2, be- 
tween monomers 1 and 3) is weaker and more flexible than the 
first interaction (Figures 5D and S2B). The finger domains of 
PB1 are major regions involved in the interface. However, due 
to incomplete model and densities at this flexible finger region, 
the interface between the two monomers could not be well-char- 
acterized. Two loops between residues 355-375 and 180-205 of 
PB1 are likely to be involved in the interface (Figure 5D). It is 
therefore plausible for the RNA to induce a conformational 
change in the dimer state favoring sideway interaction to form 
a tetramer. Moreover, the relatively lower resolution for residues 
located at the interfaces 1 and 2 of the tetramer complex further 
suggests that these interfaces are flexible and the oligomer 
conformation might be regulated. 

In addition, from 3D reconstruction of the tetrameric com- 
plex, we could reconstruct four double-stranded RNA helices 
(Figure 5B). Each double-stranded RNA helix is bound to 
and lies at the bottom of one subcomplex | monomer. The 
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Figure 4. Structure Comparison of PB1 Catalytic Domain with PV RdRP and RNA Binding Model for the Subcomplex | 
(A) Structure comparison of PB1 catalytic domain with PV RdRP (PDB ID code 30L6). PB1 is colored in yellow and PV RdRP is in gray. The RARP domains are 


labeled. 


(B) RNA binding model of the influenza virus RARP complex with vRNAs derived from structural comparison with poliovirus RARP-RNA complex. The color 
strategy is same as in Figure 3A. Among the three RNA chains from 30L6 structure, 3’ template chain is colored in orange, nascent RNA in green, and 5’ 
complementary RNA in blue. The entry and exit pathways of the product as well as the NTP gate are indicated by black arrows. 

(C) The detailed structure representing interaction of the docked RNAs chains with conserved motifs of PB1. The color strategy of the RdRP motifs is the same as 


in Figure 3C. 


location and lying conformation of each double-helix RNA sug- 
gested that it is possible for the two RNA chains (from one 
helix) to adjust their orientations for binding to its adjacent 
subcomplex. 


Mutagenesis Analysis on the Residues Located on the 
Dimer Interfaces 

In order to evaluate whether the secondary structure elements in 
subcomplex | are essential for viral replication, a high-throughput 
mutagenesis profiling of the residues, presumed to be involved 
in dimer and tetramer formation, was performed as described 
previously (Arumugaswami et al., 2008). Briefly, to produce a 
high-density mutant library with >10° mutant clones, random in- 
sertions of a 15 nt sequence (5/-TGCGGCCGCANNNNN-3’, N 
could be any nucleotide) to the plasmids encoding PB1, PB2, 
and PA were generated. Insertion of this 15 nt sequence into 
any position within the three subunits resulted changes in five 
to seven amino acids, without introducing any stop codon or 
frameshift mutation. Every residue, in these selected fragments 
of the three subunits, has been mutated at least once by this 
method as confirmed through sequencing of the clones (see 
Experimental Procedures). The surviving viral clones (replica- 
tion-competent) were selected and sequenced. 

From the plethora of data, we mainly focused on the investiga- 
tion of two regions—interface 1 (within the rigid dimer) and inter- 
face 2 (within the flexible dimer). Most of the mutations in resi- 
dues 293-350 of PA_C failed to yield viable clones (Figure 6C). 
Particularly, no viable clone with insertion mutations in the resi- 
dues L350-355 of PA_C could be observed (Figures 5C and 
6C). In the tetrameric structure of subcomplex I, a short fragment 
containing residues 86-130 of PB2_N was found to be essential 
for the formation of RNA-free dimer between monomers 1 and 2 
(Figure 5C), as discussed earlier (Figure 1B). Consistent with 
these structural and biochemical results, no viable clone con- 
taining mutations in this region was found during our mutagen- 
esis study (Figure 6C). On the contrary, clones with mutations 
at positions 53, 54, 55, 57, 60, 61, and 63 were found to be largely 
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viable. Taken together, these results suggest that residues 86- 
130 are important for replication of the influenza virus. 

Two loops in the finger domain of PB1, between residues 355- 
375 and 180-205, are proposed to be involved in interface 2. 
Similarly, only a few viable clones were observed with mutations 
in this region (Figure 6C). Some single-point mutations were also 
investigated using fluorescence CAT method, as described in 
the Experimental Procedures. lle308 is located on helix a2 of 
PA_C, at interface 1. Mutation of residue Ile308 of PA_C signifi- 
cantly decreased polymerase activity to less than 7% in a mini- 
replicon assay. Another three single mutations (GIn367, Met372, 
and Leu373) in the fragment containing residues 355-375 of PB1 
resulted in loss of polymerase activity (Figure 6A). These three 
residues, located at the bottom of the finger domain and far 
away from the catalytic center, are presumed to be involved in 
interface 2 (Figure 6B). These results further implicate the signif- 
icance of oligomerization of influenza RdRP. 


DISCUSSION 


Cryo-EM method has recently gained significant improvement 
in atomic structure reconstruction. Our results also represent 
significant progress in determination of the influenza RdRP 
structure. High structural similarity of influenza RdRP with other 
viral RdRPs strongly suggests that influenza virus may share 
some similar mechanisms for RNA synthesis. However, in 
contrast to known viral RdRPs, influenza RdRP exists in 
different oligomerization states which might predict the possi- 
bility of a different mechanism during RNA synthesis and as- 
sembly (Jorba et al., 2008, 2009; Moeller et al., 2012). Based 
on our structure and previous reports, some mechanisms 
underlying replication of influenza RdRP could be further 
discussed. 


Oligomeric State Transition Regulated by RNA Binding 
Purification of full-length influenza RdRP complex from mam- 
malian cells (HEK293T), in the form of dimer and higher 


Figure 5. Conformation of the Dimer and Tetramer 
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(A) Docking of the subcomplex | atomic models in the tetramer density map (in different views). The color strategy of the subcomplex complex is the same as in 


Figure 3A. 


(B) Overall structure of the tetrameric subcomplex. For a clear view of the four subcomplexes, the PA and PB1 subunits along with the unassigned region in 
monomer 2 and 3 are shown in pink, brown, and green, respectively. Color strategy of monomer 1 and 4 is same as in Figure 3. 
(C) Close-up views of the interface 1. Helices «1 and «2 in PA_C are shown in red color. Helices H1 and H2 in PB2 are colored in blue. Sequence 350-NEEKIP-355 


(L350-355) in PA_C is indicated by black arrow. 


(D) Close-up views of the densities for interface 2. Density for loop between residues 355 and 370 is highlighted in cyan. Density for loop residues 180-205 is 


highlighted in orange. 
See also Figure S6. 


oligomer states, has been reported previously (Jorba et al., 
2008). In addition, a soluble-free RARP complex was proposed 
to be recruited by the RNP-bound RdRP to initiate and fulfill the 
replication process (replication in trans) (Jorba et al., 2009). 
Consistent with the previous reports, our results suggest that 
subcomplex | can undergo oligomeric state transition, regu- 
lated by vRNA promoter binding. Therefore, instead of being 
an artifact from truncation of the C-terminal part of PB2, oligo- 
merization might be a necessary mechanism which reflects the 
physiological states of the full-length influenza RdRP complex. 
We propose that the dimer interfaces (1 and 2) would likely aid 
in the recruitment of another soluble RdRP to the RNP-bound 
RdRP for initiation of replication. This conclusion has been 
further substantiated from a high-throughput mutagenesis 
profiling which revealed that residues of the structural elements 
involved in formation of the dimer interfaces are critical for viral 
viability (Figure 6). 


Proposed Mechanisms for Influenza Replication 

and Transcription 

We further considered the physiological relevance for existence 
of dimer and tetramer states during replication and transcription 
of the influenza RdRP, based on the cryo-EM structure of RARP 
subcomplex |. Jorba et al. reported an RNP purification strategy 
using a transcription-defective PB2 without affinity tag (short- 
ened as “T”) or a replication-defective PB2 with C-terminal 
His-tag (shortened as “R-His”) (Jorba et al., 2009). Based on 
their results, the authors presumed that the RNP-bound influ- 
enza RARP monomer executes mRNA synthesis process (cis- 
conformation), while a new RdRP monomer is recruited to the 
RNP to initiate replication (trans-conformation). However, this 
model could not address some key questions. Compared to 
wild-type RNP, the authors could purify up to 70%-80% tran- 
scription-defective progeny RNPs through nickel affinity column, 
when they are coexpressed with a replication-defective PB2 
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Figure 6. Effects of Mutations within Influ- 
enza RdRP Complex 

(A) Insertion mutations at specific positions of 
the RdRP complex dramatically reduced the RNA 
polymerase activity. Results came from an 
average of three independently analyzed biolog- 
ical replicates. Error bars indicate the standard 
deviation among them. 

(B) The locations of the specific residues respon- 
sible for RdRP activity reduction are near the 
interface of dimer and tetramer as labeled. 

(C) Summary for a high-throughput profiling of the 
residues, proposed to be located close to the 
interaction surface area. A table listing viable 
clones of the insertion mutants. The fragments 
shown in the table are proposed to be involved in 
dimerization. Residues selected in PA_C and PB2 


molecules are involved on interface 1. Residues 
selected in PB1 are proposed to be involved in the 


interaction on interface 2. Only survival clones are 


listed in the table. Residue numbers of the survival 


A 
2 é . 7 
2 e M3729 
Ps L373 E367. 
ao 
5 
oa 
D A D 
E a a oS oS 
Fr & 
R < g 
Cc 
Gene Fragment Survive clones 
PA | 293-355 | 306(1/2) | 343(2/5) 
er | 180-205 | 180(1/3) | 189(1/2) | 196(1/3) | 199(1/3) | 200(1/2) 
| 355-375 | 372(1/1) | 374(1/3) 
PB2 | 50-130 52(1/2) 53(1/3) 54(1/4) 56(1/2) 59(1/3) 60(1/3) 62(1/2) 


clones are shown (parenthetical digits indicate 


(R-His) (See Figure 1 in Jorba et al., 2009). These results contra- 
dict the fact that the progeny (consisting of viral genome RNA 
mimic, NP, PA, PB1, and transcription-defective PB2 [T]) did 
not have any affinity tag and therefore could not be purified 
through nickel affinity column. Theoretically, the purified progeny 
should consist of PB2 with His-tag (R-His). This is consistent with 
the results showing that coexpressed hybrid progeny vRNPs 
harbor transcription activity in the range of 20%-75% compared 
to wild-type vRNPs (see Figure 2 in Jorba et al., 2009). Therefore, 
we propose that both T and R-His polymerases might exist in the 
progeny RNP—the T polymerase takes charge of the replication 
activity in the progeny viruses, while the R-His polymerase har- 
bors the transcription activity (Figure SGA). 

Based on our results and a previous report (Jorba et al., 2009), 
we presume that instead of one RdRP, two RdRP complexes (or 
dimer) might be bound to a viral RNP. During replication, the 
RNP-bound RdRP dimer recruits another free dimer to form a 
transient tetramer during replication (trans-acting). The newly re- 
cruited dimer binds with nascent 5’ RNA and later dissociates 
from the parental RdRP to proceed with RNA elongation along 
the genome. Finally, a progeny RNP is generated and released 
from the parental RNP. In the case of transcription, the two 
RNP-bound RdRP monomers (in a dimer) might disassociate 
from each other through involvement of 3’ arm vRNA promoter. 
While one dimer (on the RNP) fulfills the requirement for promoter 
binding, the other monomer proceeds with transcription. As 
such, recruitment of additional soluble RdRP is not necessary 
for the transcription process. These presumptions from our 
structural studies correlate with the previous report (Jorba 
et al., 2009) and provide meaningful conclusions for the authors’ 
results. Due to the involvement of different oligomer states of 
influenza RARP, we also predict that the conformational change 
would be significant for transition between replication and tran- 
scription, which might influence the locations of PB2 and 
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survival ratio; for example, 1/2 implicated as two 
different clones are found on one residue with one 
survival clone). No survival clone within PB2 resi- 
dues 86-130 was found. 


PA_N. Further structural and functional studies with full-length 
RdRP subunits during transcription will assist in validating this 
hypothesis. 

During the process of submission of our manuscript, crystal 
structures for the RdRPs from type B influenza viruses and an 
unconventional bat influenza virus (H17N10) were reported 
(Pflug et al., 2014; Reich et al., 2014). These structures were 
obtained from full-length polymerase and were reported as 
monomer state in solution. The RNA synthesis and transcription 
mechanism of the influenza RdRP were discussed in these re- 
ports, which can be regarded as a breakthrough in structural 
study of influenza RdRP. However, the mechanism for replica- 
tion, as well as transition between replication and transcription 
processes, was only limitedly discussed. The overall structure 
of our cryo-EM reconstruction and the reported crystal struc- 
tures are precisely similar to each other. However, marked 
differences at the dimer interface between monomers 1 and 2 
(Figure S6B) could be observed. In the tetramer structure, the 
PB1_C/PB2_N domain is observed to be located far away from 
the cavity and extends toward the adjacent molecule on the 
dimer interface. We propose that our cryo-EM structure of the 
RdRP oligomer assembly might reflect the replication state of 
influenza life cycle, while the reported monomer RdRP confor- 
mations represent the transcription state. These differences 
might suggest the unique features specific to canonical influenza 
A RdRPs as well as a mechanism for conformational transition 
during transcription and replication. 

In summary, we reconstructed a 4.3 A resolution model of the 
oligomeric structure of the influenza polymerase by cryo-EM sin- 
gle-particle method. From these results, we propose that RNA 
elongation, processed by PB1 catalytic domain, should follow 
a classical pathway similar to that of other double-stranded or 
positive-stranded viruses. The tetramer structure further pro- 
vided evidence to understand the mechanism of oligomerization 


and possible conformational transition during viral replication 
and transcription. Our results provide a solid basis for further 
structural studies of mechanisms governing the functioning of 
RNA polymerase from influenza as well as other negative- 
stranded RNA viruses. 


EXPERIMENTAL PROCEDURES 


Protein Expression and Purification 

The genes encoding PA, PB1, and PB2 (residues 1-130 or 1-86, generating 
polymerase subcomplexes | and II, respectively) were cloned into pFastBac 
vector (Invitrogen). All the RNAs used in this report were synthesized by Takara 
Inc. The RdRPs were overexpressed in insect cell line (Hi5) using Bac-to-bac 
expression system (Invitrogen). The subcomplexes were purified through 
nickel affinity chromatography using 6-His tag at the C terminus of PB1. The 
proteins were further purified using ion-exchange chromatography (GE) to 
eliminate contamination. The oligomerization states of the purified polymerase 
complexes were determined through size-exclusion chromatography using a 
Superdex-200 (16/60 or 10/30) column (GE). 


EM Data Acquisition 

Grid Preparation and Data Collection 

Aliquots of 2.8 ul of purified tetrameric subcomplex | at a concentration of 
~0.2 mg/ml were applied to glow-discharged holey carbon grids (Quantifoil 
R2/1) and flash-cooled in liquid ethane using an FEI Vitrobot. Data were 
collected on an FEI Titan Krios operating at 300 kV. Images were recorded us- 
ing a K2 Summit direct electron counting camera (Gatan Inc.) at a calibrated 
magnification of 22,500 (pixel size of A), with the defocus ranging from —1.0 
to —3.0 um. All dose-fractionated cryo-EM images were recorded using 
UCSF-Image4, a semiautomated low-dose acquisition program (X. Li and Y. 
Cheng, personal communication). 

Image Processing 

The initial model was generated by tomographic subvolume averaging of the 
negatively stained specimen. Tomographic reconstruction of 3D volumes 
was performed by the weighted back-projection method using Xplore 3D soft- 
ware (FEI). The tomographic subvolumes of single particles were picked and 
extracted using the e2tomoboxer.py program of EMAN2 (Tang et al., 2007). 
These subvolumes were aligned and averaged using the Dynamo software 
package (Castafno-Diez et al., 2012). For cryo-EM micrograph, drift-corrected 
images were generated using the algorithm developed by Li et al. (Li et al., 
2013). Particle picking was performed using EMAN2 (Tang et al., 2007), 
contrast transfer function parameters were estimated using CTFFIND3 (Mind- 
ell and Grigorieff, 2003), and all 2D and 3D classifications and refinements 
were performed using RELION (Scheres, 2012). The resulting density maps 
were corrected for the modulation transfer function (MTF) of the detector 
and sharpened using postprocessing options of RELION (Scheres, 2012). 
Local resolution variations were estimated using ResMap (Kucukelbir et al., 
2014). 


Model Building and Refinement 

The structure of PA_C (PDB ID code 3CM8) was docked into the map manually 
in Chimera (http://www.cgl.ucsf.edu/chimera/). The catalytic motifs and topol- 
ogy of the structure were recognized according to sequence and structure 
comparison with reovirus (1N1H) (residues 380-890), foot and mouth disease 
virus (PDB ID code 3KLV), norwalk virus (PDB ID code 3BSO), and hepatitis C 
virus RdRPs (PDB ID code 2BRK) (Figure S4). Secondary structure elements 
from PB1 were built with polyalanine in Coot (Emsley et al., 2010) according 
to the density map. The model was further refined against the map with stereo- 
chemical and secondary structure restraints using Phenix.real_space_refine 
(Adams et al., 2010). 


Mutagenesis, Sequence Analysis, and Screening of Influenza 

Mutant Clones 

Procedures for high-throughput mutagenesis profiling were performed as 
described previously (Arumugaswami et al., 2008). In brief, the PA, PB1, and 
PB2 gene segments of influenza virus A/WSN/33 underwent Mu transposon- 


mediated mutagenesis, creating randomly positioned gene insertions. The 
mutant libraries were cotransfected with the seven complementary WT plas- 
mids for viral packaging in HEK293T cells (Hoffmann et al., 2000). Virus was 
then collected and used to infect MDCK cells in three consecutive passages. 
vRNA from each passage was recovered, converted to cDNA, and genotyped 
by PCR amplification using gene-specific forward primers and an insertion- 
specific reverse primer. Individual mutant clones were randomly selected for 
sequencing and screened for viability and polymerase activity. Polymerase 
activity was assessed by cotransfection of the WSN PB1, PB2, PA, and NP 
plasmids with a Gaussia luciferase reporter construct. The MRNA, vRNA, 
and cRNA transcript levels were quantified using a strand-specific real-time 
RT-PCR method. Primer extension assays of vVRNAs and RNA binding assay 
were performed as previously described (Hara et al., 2006). Additional exper- 
imental procedures can be found in the Supplemental Information. 
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